The method as set forth in claim 1, wherein said files are located on corporate 
websites. 

The method as set forth in claim 1, wherein said files are located on magazine 
websites. 

The method as set forth in claim 1, wherein said files are located on newspaper 
websites. 

The method as set forth in claim 1, wherein said files are located on press 
release websites. 

The method as set forth in claim 1, wherein said files are located on 
professional websites. 

The method as set forth in claim 1, wherein said files are located on 
association websites. 

The method as set forth in claim 1, wherein said files are located using a 
publicly accessible search engine. 

The method as set forth in claim 1, wherein said files are located using a 
custom designed spider. 

The method as set forth in claim 1, wherein said files are located by selecting 
one or more links in said computer distributed system. 
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14. The method as set forth in claim 13, wherein said one or more links are 
selected based on their proximity to a set of keywords. 

The method as set forth in claim 1, wherein said files are located using a 
previously generated list of said files. 

The method as set forth in claim 1, further comprising the step of evaluating a 
tense related to said business data. 

17. The method as set forth in claim 16, wherein said business data is 
discarded based on said tense. 

The method as set forth in claim 1, wherein said step of locating comprises the 
step of using one or more tags to locate said files containing said business data. 

The method as set forth in claim 1, wherein said step of parsing comprises the 
step of using one or more tags to extract said business data. 

The method as set forth in claim 1, further comprising the step of creating a 
concordance table of said business data. 

21. The method as set forth in claim 20, further comprising the step of rating 
and bounding said business data. 
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22. The method as set forth in claim 1, wherein said step of parsing comprises the 
step of using inclusion and exclusion characteristics to extract said business 
data. 

23. The method as set forth in claim 1, further comprising the step of normalizing 
said business data. 

24. The method as set forth in claim 1, further comprising the step of eliminating 
duplicate sets of business data. 

25. The method as set forth in claim 1, further comprising the step of extracting 
date or time stamps of said files that contain said business data. 

26. The method as set forth in claim 25, further comprising the step of 
evaluating said date or time stamps of said files. 

27. The method as set forth in claim 25, further comprising the step of 
evaluating said date or time stamps with date or time stamps of 
previously extracted files. 

28. The method as set forth in claim 25, further comprising the step of 
updating said business data using said date or time stamps. 

29. A program storage device accessible by a computer, tangibly embodying a program of 
instructions executable by said computer to perform method steps for compiling 
business, said methods steps comprising: 
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(a) locating files within said distributed computer system that contain said business 
data; 

(b) parsing said files to extract said business data; and 

(c) transferring said extracted business data to an interested party. 

5 

30. The program storage device as set forth in claim 29, further comprising the 
step of evaluating said files containing said business data to determine a 
confidence level of finding a subset of said business data. 



10 31. The program storage device as set forth in claim 29, further comprising the 

C3 

*0 step of evaluating said files containing said business data to determine a 

confidence level of finding said business data. 



5 3 



r- 32. The program storage device as set forth in claim 29, further comprising the 

Q5 step of evaluating said files containing said business data to determine a 

fy confidence level of finding a set of keywords in said files containing said 

□ 

rg business data. 

33. The program storage device as set forth in claim 29, wherein said files are 

20 located on corporate websites. 



34. The program storage device as set forth in claim 29, wherein said files are 
located on magazine websites. 
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The program storage device as set forth in claim 29, wherein said files are 
located on newspaper websites. 
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36. The program storage device as set forth in claim 29, wherein said files are 
located on press release websites. 

5 37. The program storage device as set forth in claim 29, wherein said files are 

located on professional websites. 

38. The program storage device as set forth in claim 29, wherein said files are 
located on association websites. 

10 

Q 

*3 39. The program storage device as set forth in claim 29, wherein said files are 

\Q 

Jf located using a publicly accessible search engine. 

r A 40. The program storage device as set forth in claim 29, wherein said files are 

C35 located using a custom designed spider. 

fy 

p 41. The program storage device as set forth in claim 29, wherein said files are 

located by selecting one or more links in said computer distributed system. 

20 42. The program storage device as set forth in claim 41, wherein said one or 

more links are selected based on their proximity to a said of keywords. 

43. The program storage device as set forth in claim 29, wherein said files are 
located using a previously generated list of said files. 
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The program storage device as set forth in claim 29, further comprising the 
step of evaluating a tense related to said business data. 

45. The program storage device as set forth in claim 44, wherein said 
business data is discarded based on said tense. 

The program storage device as set forth in claim 29, wherein said step of 
locating comprises the step of using one or more tags to locate said files 
containing said business data. 

The program storage device as set forth in claim 29, wherein said step of 
parsing comprises the step of using one or more tags to extract said business 
data. 

The program storage device as set forth in claim 29, further comprising the 
step of creating a concordance table of said business data. 

49. The program storage device as set forth in claim 48, further comprising 
the step of rating and bounding said business data. 

The program storage device as set forth in claim 29, wherein said step of 
parsing comprises the step of using inclusion and exclusion characteristics to 
extract said business data. 

The program storage device as set forth in claim 29, further comprising the 
step of normalizing said business data. 
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52. The program storage device as set forth in claim 29, further comprising the 
step of eliminating duplicate sets of business data. 

5 53. The program storage device as set forth in claim 29, further comprising the 

step of extracting date or time stamps of said files that contain said business 
data. 

54. The program storage device as set forth in claim 53, further comprising 
10 the step of evaluating said date or time stamps of said files. 

H 55. The program storage device as set forth in claim 53, further comprising 

*£ the step of evaluating said date or time stamps with date or time stamps 

M= of previously extracted files. 

fy 56. The program storage device as set forth in claim 53, further comprising 

q the step of updating said business data using said date or time stamps. 

IF™ 

57. A computer program product, comprising: 
20 (a) business data compiled from files located in a distributed computer system, 

wherein said files are parsed to extract said business data; and 
(b) a computer readable medium that stores said extracted business data. 

58. The product as set forth in claim 57, wherein said business data is determined 
25 based on a confidence level of finding a subset of said business data in said 

files containing said business data. 
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The product as set forth in claim 57, wherein said business data is determined 
based on a confidence level of finding said business data in said files 
containing said business data. 

The product as set forth in claim 57, wherein said business data is determined 
based on a confidence level of finding a set of keywords in said files 
containing said business data. 

The product as set forth in claim 57, wherein said files are located on corporate 
websites. 

The product as set forth in claim 57, wherein said files are located on magazine 
websites. 

The product as set forth in claim 57, wherein said files are located on 
newspaper websites. 

The product as set forth in claim 57, wherein said files are located on press 
release websites. 

The product as set forth in claim 57, wherein said files are located on 
professional websites. 

The product as set forth in claim 57, wherein said files are located on 
association websites. 
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The product as set forth in claim 57, wherein said files are located using a 
publicly accessible search engine. 

The product as set forth in claim 57, wherein said files are located using a 
custom designed spider. 

The product as set forth in claim 57, wherein said files are located by selecting 
one or more links in said computer distributed system. 

70. The product as set forth in claim 69, wherein said one or more links are 
selected based on their proximity to a set of keywords. 

The product as set forth in claim 57, wherein said files are located using a 
previously generated list of said files. 

The product as set forth in claim 57, wherein said business data is extracted 
based on a tense. 

73. The product as set forth in claim 72, wherein said business data is 
discarded based on said tense. 

The product as set forth in claim 57, wherein one or more tags are used to 
locate said files containing said business data. 
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The product as set forth in claim 57, wherein one or more tags are used to 
parse said business data. 

The product as set forth in claim 57, wherein said business data is extracted 
based on a concordance table. 

77. The product as set forth in claim 76, wherein said one or more tags are 
associated with said business data. 

The product as set forth in claim 57, wherein business data is extracted using 
inclusion and exclusion characteristics. 

The product as set forth in claim 57, wherein said business data is normalized 
using one or more tags. 

The product as set forth in claim 57, wherein duplicate sets of business data are 
eliminated. 

The product as set forth in claim 57, wherein date or time stamps are extracted 
from said files. 

82. The product as set forth in claim 81, wherein said date or time stamps of 
said files are evaluated. 

83. The product as set forth in claim 81, wherein said date or time stamps are 
evaluated with date or time stamps of previously extracted files. 
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84. The product as set forth in claim 81, wherein said business data 
updated using said date or time stamps. 
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